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ABSTRACT 

In the context of measuring structure and morphology of intermediate redshift galaxies with recent 
HST/ACS surveys, we tune, test, and compare two widely used fitting codes (Galfit and Gim2d) 
for fitting single-component Sersic models to the light profiles of both simulated and real galaxy 
data. Our study focuses on the Gems survey with the sensitivity of typical HST survey data, and we 
include our final catalog of fit results for all 41,495 objects detected in Gems. We test the reliability 
of both codes using simulated galaxy profiles constructed to represent the imaging characteristics of 
Gems. We find that fitting accuracy depends sensitively on galaxy profile shape. Exponential disks 
are well fit with Sersic models and have small measurement errors, whereas fits to de Vaucouleurs 
profiles show larger uncertainties owing to the large amount of light at large radii. We find that both 
codes provide reliable fits and little systematic error, when the effective surface brightness is above 
that of the sky. Moreover, both codes return errors that significantly underestimate the true fitting 
uncertainties, which are best estimated with simulations. We find that Gim2d suffers significant 
systematic errors for spheroids with close companions owing to the difficulty of effectively masking 
out neighboring galaxy light; there appears to be no work around to this important systematic in 
Gim2d's current implementation. While this crowding error affects only a small fraction of galaxies 
in Gems, it must be accounted for in the analysis of deeper cosmological images or of more crowded 
fields with Gim2d. In contrast, Galfit results are robust to the presence of neighbors because it 
can simultaneously fit the profiles of multiple companions thereby dcblcnding their effect on the fit to 
the galaxy of interest. We find Galfit's robustness to nearby companions and factor of > 20 faster 
runtime speed are important advantages over GIM2D for analyzing large HST/ACS datasets. 
Subject headings: methods: data analysis — catalogs — surveys — galaxies: general — galaxies: 
photometry — galaxies: statistics 



1. INTRODUCTION 

One of the central goals of observational exploration 
of galaxy evolution is to understand how the structures 
of galaxies evolve with cosmic time. A powerful tool 
in this context are large look-back surveys, where the 
time evolution of the distribution of galaxy structural 
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properties can be quantified. The key to unlocking the 
potential of these surveys is the development of quantifi- 
able, well-understood, and repeatable ways to measure 
and describe galaxy structures. Using such met hods, 
the evolution of the structure of disk galaxies (iLillvj 
1 19981: ISimard k. Pritchetl [l998t iRavindranath et"atl 
booi Barden et alJ 120051 : iTruiillo fc Pohlenl 120051 : 
ISargent et al.l 120061 ) and spheroid-dominated g a laxies 



. iSchade et al l I1997L Il999t iMdntosh et al.l 120051 : 
iTrujillo et al.ll2004L 120061 ) has been quantified over the 
last 10 billion years of cosmic time, since z — 3. In this 
paper, we exhaustively test an d tune two param etric 
galaxy fitting codes, Galfit (|Peng et al.l l2002f ) and 
Gim2d jS imardl 120021 ) . that are commonly used in the 
literature. With these tests we determine the best fitting 
setups for each code, quantify the sources of random 
and systematic uncertainty, and present s parametric fits 
for 41,495 objects in the HST Gems (|Rix et al.l l200l 
dataset. 

There are two main approaches towards describing 
galaxy structure from the two-dimensional information 
contained in image data. Non-parametric methods pro- 
vide estimates of total brightness, galaxy half-light size, 
and structure, using metrics which do not depend on a 
galaxy, having a structur e well-described by any particu- 
lar functional form (e.g. . lPetrosianllT976l : I Abraham et al.l 
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1996 ; Bershady, Jangrcn & Conselici l2000t lLotz et all 



20061 ) . The main disadvantages of non-parametric meth- 



ods are that they are reasonably sensitive to the depth of 
the images; because there is no parametric form for ex- 
trapolating to account for the faint outer parts of galax- 
ies, one can underestimate flux and/or size in poorly- 
posed cases ([Blanton et al. I [20031 ). Parametric methods, 
in contrast, choose particular functional forms (some- 
times reasonably complicated) with which to fit the 
galaxy light distribution. These have substantially less 
flexibility than non-parametric fitting codes, but have 
the advantage that light at large radii can be accounted 
for reasonably well by the natural extrapolation of the 
best-fitting model profile (under the assumption that the 
parametric form chosen does, in fact, describe the light 
profile in the outer parts of galaxies reasonably well) . Be- 
sides robust estimates of galaxy size, parametric methods 
provide measures of galaxy structure that may shed light 
on relative contributions of physically distinct and mean- 
ingful components such as spheroids, disks, and stellar 
bars. 

One particularly useful and flexible pr ofile for para- 
metric galaxy fitting is a single-component ISersid (|1968l ) 
model, which describes the radial surface brightness pro- 
file of a galaxy by the Sersic function given by 



S(r) = E e • exp [- K (( r /r e )^ n - 1)], 



(1) 



where r e is the radius of the galaxy (Note that for a Sersic 
fit r c is equivalent to the half-light radius ?"5o), S e is the 
surface brightness at r e , and the Sersic parameter n de- 
scribes the profile shape (the parameter k is closely con- 
nected to n) . Together with position (x and y) , axis ratio 
b/a and position angle, this profile has 7 free parameters. 
The Sersic profile represents a more general form of the 
exponential light- profiles seen in galactic disks (n = 1) 
and the i? 1 / 4 -law (de Vaucouleurs law) pr ofiles typical of 
luminous early-type galaxy (n ~ 4) (e.g. . I de Vaucouleurs! 
ll948tlFreemanlll970t) ; fitting with this profil e has been ex- 
plore d in detail in a numbe r of works (e.g. , Simardlll998l . 
[20021: iGraham et all [20051 : iTruiillo et all boOlh . Fig- 
ure Q] shows some examples of Sersic profiles with dif- 
ferent n. Many authors have used a constant value 
of n = 2.5 or n — 2.0 to crudely distinguish early- 
type (bulge-do minated) from l a te-type (disk-dominated) 
galaxies 



e. g.. iBlanton et al.l 120031: iShen et al.l 
Hogg et all 120041: iBell et all l2004t fearden et all 



2003 



2005 



Mcintosh et al.l l2005t iRavindranath et all l2004f h Fur 

thcrmore, fitting galaxies with a Sersic profile gives an 
estimate of size, and therefore is very useful for the ex- 
amination of the evolution of galaxy scaling relations. 

The goal of this paper is to describe our efforts to op- 
timize the estimation of single-compon ent Sersic profil e 
fits to the galaxies in the Gems survey (|Rix et al.H200l . 
To date, this has been our primary method for quantify- 
ing galaxy structure 12 . We compare the performance of 
the Galfit and Gim2d automated galaxy fitting codes, 
which ar e suitable for fitting large datase ts such as Gems, 
Stag es (jGrav et al! in prep.), Go ods (jGiavalisco et al I 
I2004T) and Cosmos (|Scoville et all ). We address the re- 
liability and limitations of these codes through thorough 
testing, using simulated and real galaxies. We describe 

12 Bulge-disk composite galaxies were not simulated for this pa- 
per; bulge-disk decomposition will be addressed in a future paper. 
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Fig. 1. — This plot shows Sersic profiles for different values of 
the Sersic index n, normalized to have the same flux at r e . One 
can see that profiles with high Sersic indices n > 2 have more flux 
at larger radii; thus, a good estimate of the background sky level 
is particularly important for precise fitting of high-n galaxies. 



the details of the simulations used throughout this paper 
in 33 In 521 we explore different set-ups, converging on 
'best-fitting' set-ups for each fitting code, ^summarizes 
the results from our testing of these 'best-fitting' set-ups 
using both simulated galaxies ( §4.11 and §4.21) and real 
galaxies ft j4.3[) . We compare our findings with those of 
a recent paper on the same topic (using the code s Gal- 
fit, Gim2d, and Gasphot) by iPignatelli et al.l (|2006f ) 
in §31 and publish a catalog of Galfit fitting results for 
all 41,495 detected galaxies from the Gems survey in §13 
All results from this paper and a number of other cat- 
alogs and images useful for testing galaxy fitting codes 
are presented on the Gems webpage 13 . 

2. SIMULATIONS 

Galaxy simulations are an invaluable tool for un- 
derstanding the performance of quantitative fitting 
pipelines. In this section we describe the set of simula- 
tions that are extensively used for this paper; the results 
obtained from fitting these simulations are discussed in 

§3 

In this paper, we focus on simulations of two different 
galaxy light profiles: purely exponential profiles (n = 1) 
representing the luminosity profile of a galactic disk (we 
will call these galaxies 'disks' throughout this paper), 
and Sersic profiles with a Sersic index of n = 4 repre- 
senting a de Vaucouleurs luminosity profile of a galactic 
bulge/elliptical galaxy (we will call them 'spheroids'), re- 
spectively. Profiles having Sersic indices between these 
two values of 1 and 4 are not presented here because 
n = 1 and n = 4 simulations span the range of observed 
behavior, exponential profiles being the 'easiest' to fit, 
de Vaucouleurs being the 'hardest'. Nonetheless, exten- 
sive simulations of intermediate profiles (200,000 objects) 



see http:/ /www. mpia.de/GEMS/gems. htm 
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have been produced, the catalog of Galfit fitting results 
for this sample can also be downloaded from the Gems 
webpage. 

This section is arranged as follows. The simulation of 
individual galaxies is described in §2.11 Section 12.21 de- 
scribes the construction of simulated Gems frames from 
the individual galaxy simulations, including the addition 
of realistic noise. 

2.1. Simulation of individual noise-free galaxies & 
Oversampling 

Galaxies were simulated using a custom-built IDL rou- 
tine. Most available standard routines (like MKOBJECT in 
IRAF, create/image in MIDAS and similar tasks in other 
programs) compute the correct flux value for the center 
of the pixel, but due to curvature of the profile, taking 
this as the mean flux value for the whole pixel is incor- 
rect. The higher the curvature is (within a certain pixel), 
the more one underestimates the true pixel value. This 
implies progressively larger inaccuracies for higher Sersic 
indices. 

While it is possible to analytically integrate the profile 
across a pixel to obtain an exact answer, this procedure 
is very CPU-intensive. We adopt a hybrid approach. 
We use IDL's dist_ellipse routine in conjunction with 
equation fl} to compute Sersic galaxy models which, as 
in the above cases, are only correct for the center of the 
pixel. In order to increase accuracy, the inner parts of 
our simulated profiles (100x100 pixels up to 200x200 pix- 
els depending on object size) have been oversampled by 
a factor of 10, and the very inner parts (10x10 up to 
20x20 pixels) are oversampled by a factor of 100. This 
was done by creating the images by a factor of 10 (or 
100 respectively) bigger and then rebinning the image 
while holding the total flux constant. In this way, it is 
possible to create a final profile accurate to better than 
0.03% at all radii (much smaller than the poisson noise 
added later in the simulation process) with a factor of 
100 gain in speed compared to the analytical integration 
- an important gain when simulating large samples of 
galaxies. 

2.2. Simulation of crowded images 

To realistically test galaxy extraction and fitting codes 
requires the creation of images with large numbers of 
simulated galaxies distributed as in real data. Such im- 
ages were created by providing a catalog of simulated 
galaxy input parameters to the simulation code, which 
simulated galaxies at the location, luminosity, size, ori- 
entation and axis ratio b/a specified in this catalog. In 
this step, galaxies were put in an empty image of the 
same size as the final image. 

To choose the range of galaxy parameters for the sim- 
ulated galaxies, we first fitted all Gems galaxies with 
Galfit and determined the parameter range covered by 
the real galaxy sample. Given these results, we chose a 
wider range of parameter space for the simulations, in 
order to test detection efficiency, completeness, and to 
allow pushing both parametric fitting codes to their lim- 
its. The simulations have a random distribution in size 
between 2 and 316 pixels (uniformly distributed in log- 
arithmic space) and magnitude between 20 and 27 (uni- 
form). With this distribution of parameters, there were 



a relatively large number of large and low surface bright- 
ness galaxies (stringently testing the detection efficiency 
and fitting codes); we discuss this point in more detail in 
§4.21 The exact distributions of simulation parameters 
are given in Tables Q] and [TJ 

After simulating the galaxy profiles and putting them 
in an empty image, this final image was convolved with a 
real F850LP-band P SF derived from the Gems dataset 
(jJahnke et al J 120041) . Next, an appropriate amount of 
noise had to be added to the images. Owing to the 
multiple-frame dither characteristic of HST imaging sur- 
veys, the noise is somewhat correlated pixel-to-pixel. 
Thus, strictly speaking, galaxies should be simulated in 
individual dithers, then dithered together using exactly 
the same routines as were used to combine the Gems 
frames. We took an intermediate approach: Poisson 
noise with the same RMS as the Gems noise was added 
to the simulated galaxy frame, then a real 'sky' frame was 
added to the simulated frame to accurately account for 
real fluctuations and correlated noise in observed HST 
sky backgrounds. We have confirmed through tests with 
Galfit that this (much less CPU- and work-intensive) 
hybrid approach yields a scatter which is negligible com- 
pared to random fitting uncertainties. The 'sky' frame 
was constructed by adding Gems F606W- and F850LP- 
band frames (to increase image depth and to make sure 
that objects appear in neither of the two bands) and visu- 
ally checking those images to identify patches of 500x500 
pixels in size without objects detected by SExtractor. 
The chosen patches were cut from the F850LP-band im- 
ages and pasted together to form an empty image of the 
same size as an original Gems image. 

The result of the simulation process was a simulated 
image with noise properties very similar to a real Gems 
image that contained 800 simulated light profiles with 
different magnitudes, sizes, position angles and values 
of b/a. Different sets of simulations were created in this 
way: one set contains disk-like n = 1 galaxies only (for re- 
sults see §4.1.ip and the other contains spheroidal n = 4 
profiles only (see §4.1.2p . Not all 800 galaxies were recov- 
ered by SExtractor. Roughly 80% of the objects were 
recovered, depending on the simulated profile shape and 
the distribution of galaxy parameters in the particular 
image (see Figure [2]) . In particular, very large and low 
surfac e-brightness gal axies were not detected (see Figure 
[2]and iRix et aI1l2004f h Due to the fact that spheroids are 
easier to detect due to their centrally concentrated light 
profiles, less galaxies were recovered in the disk sample. 

By design, these simulated tiles are artificial in two 
ways. Firstly, the galaxy input parameters span a wider 
range in parameter space than real galaxies. Secondly, 
the simulated tiles are significantly more crowded than 
the actual data itself, about a factor of 7-8 overdense in 
galaxies with Sersic index n > 2.5 compared to a typ- 
ical ACS image from Gems (see Figure[3]). They con- 
tain many more LSB galaxies (detected and undetected), 
adding a complex layer of extra flux to the background. 
This makes the simulations more difficult to analyze than 
real data; this was intentional since we wanted to push 
both codes to their limits. In a third set of simulations 
we mixed the two types of profile (see §4.2|) to estimate 
the effects of deblending given a more realistic mix of 
n = 1 and n — 4 galaxies. To test and compare the 
two different 2D-fitting routines, the simulated images 
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TABLE 1 

Simulation parameters, disk (n=l) galaxies 



Parameter 


Min 


Max 


Distribution 


mag [mag] 


20 


26.5 


uniform 


r e [pixel] 


2 


316 


uniform in logarithmic space 








r e < 1 o 7 - 36- °- 233 ' ma 9 , mag being chosen magnitude for object 


b/a 


0.18 


1 


uniform in cos(i), i being inclination angle 






corrected for intrinsic thickness: 








b/a = y/cos 2 (i) + (sin(i) ■ 0.18) 2 








intrinsic thickness 0.18 following Pizagno ct al. (2005); Rvden (2006) and others 


PA [deg to image] 





180 


uniform 


Sersic index n 


1 


1 


fixed 



TABLE 2 

Simulation parameters, spheroidal (n=4) galaxies 



Parameter Min Max Distribution 

mag [mag] 20 27 uniform 

r e [pixel] 2 630 uniform in logarithmic space 

v < ^q4.79— 0.1-mog 

r < io 11 - 49-0 - 392 '" 10 ^ 

b/a 0.45 1 uniform in cos(i), i being inclination angle 

corrected for intrinsic thickness: 
b/a = Vcos 2 (i) + (sin(i) ■ 0.45) 2 

PA [deg to image] 180 uniform 

Sersic index n 4 4 fixed 



were treated as 'real' images, i.e. we used exactly the 
same data pipeline for fitting that was used for the real 
Gems data analysis. Therefore, all effects which we can 
see in the results from simulations should be present in 
real data as well, although mixed with many other ef- 
fects like bulge/disk composite profiles, non-smoothness, 
lumpincss and/or spiral features of real galaxies. 

3. GALAXY FITTING: DESCRIPTION, BASIC 
CONSIDERATIONS, BEST-FITTING SETUPS 

For the Gems analysis, we have used two widely- 
employed parametric fitting codes for quantitatively de- 
scribing galaxy structure and morphology: Galfit and 
Gim2d. In this section, we describe both codes and 
the procedures used to parametrically fit both the real 
Gems data and the simulations described in the previ- 
ous section. The basic considerations for code setup and 
application to real data, and the tests which we have 
performed on simulated data, arc useful in general to 
other workers in galaxy image fitting. These basic con- 
siderations for setup and application of these (and most) 
fitting codes are (1) sky estimation, (2) initial parame- 
ter guesses, (3) postage stamp construction, and (4) de- 
blending and/or masking of neighboring sources. We de- 
scribe in detail the setups and various tests we carried 
out in order to optimize these setups in ^3. II for Galfit 
and g3j for Gim2d. 

The initial conditions and setups for both Galfit and 
Gim2d are determined using SEXTRACTOR output im- 
a ges and catalogues. We use SExtractor (version 2.2.2 
. iBertin fc Arnoutd [l996h for image parsing and cata- 
log creation. SExtractor detects, deblends, measures 
and classifies objects, giving estimates of magnitude, 
size, b/a, position angle and a star-galaxy classification. 



In Gems, we found that no single SExtractor setup 
satisfactorily detected and deblended both bright, well- 
resolved galaxies and faint galaxies near the detection 
limit. Accordingly, our best setup is to run SExtrac- 
tor twice: once to detect the bright objects without 
splitting them up (what we call the 'cold' version) and 
once to detect the faint objects ('hot' version). The two 
versions are then combined to give one single catalog con- 
taining all objects . The proc edure is described in more 
detail in lRix et all (|2004h and lCaldwell et all (|2006t ). We 
do not use the SExtractor output catalogs directly for 
science; instead, these values are used as initial estimates 
for galaxy fitting codes and their setup. In the following 
sections we will describe which parameters are taken as 
starting guesses and how these values are used for the 
two parametric galaxy fitting codes used in this work: 
Galfit and Gim2d. 

3.1. GALFIT 

Galfit is a 2 D gal axy fitting software package written 
bv iPeng et "all (|2002ft . We used Galfit Version 2.0.3b 
from Feb. 2, 2005 for this analysis. Galfit was designed 
to extract structural components from galaxy images. 
Compared to other fitting techniques it has two main 
advantag es. It uses a Levenberg-Marquardt downhill- 
gradient (|Pressin"997f) method to derive the best fit and 
therefore is relatively fast, being able to fit roughly 3000 
galaxies per day on a dual 2.4 GHz LINUX processor 
(when running 4 threads simultaneously to efficiently use 
all CPU time). Furthermore, due to its speed and design, 
it is able to fit an image containing an arbitrary num- 
ber of galaxies simultaneously, making it possible to fit 
neighboring objects. The main disadvantage of Galfit, 
in theory, is that it is possible that it converges on fit 
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Fig. 2. — F850LP magnitudes and sizes for the full set (all symbols) of 1600 simulated n=l galaxies (left) and n=4 galaxies (right), 
and the subsets that were detected by S EXTRACTOR (small grey crosses). The black squares indicate galaxies th at were misse d during 
object d etecti on. The in dicated contours show the magnitude-siz e spac e populated by actual Gems galaxies used by Bardcn et al. (for disk 
galaxies 2005, left) and Mcintosh ct al. (for spheroidal galaxies 2005, right); the contours show the areas of parameter space where the 
reliability of the fitting routines becomes especially important. Whereas real n > 2.5 galaxies lie in the area where all galaxies are detected, 
we did use n < 2.5 galaxies that are close to the edge of detectability for our analysis. The different behavior of the non-detected galaxies 
in both samples reflects the fact that, due to their bright central peak, galaxies with a high Sersic index are easier to detect than galaxies 
with low n. To guide the eye, we overplot long-dashed lines of constant surface brightness of 17, 19, 21, 23 and 25 magnitudes arcsec -2 
from bottom to top. 



solutions that represent a local minimum instead of giv- 
ing the global minimum. Our experience with Galfit 
is that in single component, but multi-object, fits this 
happens relatively rarely, if at all, both through the sim- 
ulations ( §4.1|) and through comparison of fitting results 
for real galaxies from Galfit and Gim2d ( §4.3|) . 

During the fitting process, the model is convolved with 
a specified PSF to model the image seeing and then com- 
pared to the input image. It is possible to fit the back- 
ground sky level during the fitting process, although in 
this paper we use this capability for testing purposes only 
(see pX2|) . 

In the following section, we will explain the basic setup 
procedure of Galfit in detail; e.g., cutting postage 
stamps, estimating the sky background, deciding on how 
galaxies should be deblended, and setting up the initial 
parameters for Galfit. We developed automated rou- 
tines for this purpose, and we describe their most im- 
portant features in this section. As sky background is of 
critical importance, we discuss this issue in some detail 

in gsm 

3.1.1. Galfit setup and Galapagos 

Galfit is designed to fit one galaxy of interest at 
a time. Therefore, we created an individual postage 
stamp for each galaxy of interest . These postage 
stamps were created, and initial Galfit parameter files 
produced, by an IDL program, GALAPAGOS (Galaxy 
.Analysis over Large Areas: Parameter Assessment by 
GALFlTting Objects from SExtractor, for further de- 
tails about Galapagos and details of the procedure see 
Barden et al., in prep.). For every object in the SEx- 
tractor catalog Galapagos did the following. 



1. First, GALAPAGOS determined the size of the re- 
quired postage stamp for each object. This was 
done using different object sizes and angles given 
by SExtractor: 

Xsize = 2.5*a*kron*(\ sin(6*)| + (l — ellip)*\ cos(#)|) (2) 

Y size = 2.5*a*kron*(\ cos(#)| + (l — ellip)*\ sin(#)|) (3) 

where a is the SExtractor output parameter 
A_IMAGE, kron is KR0N_RADIUS, 9 is THETA_ IMAGE 
and ellip is ELLIPTICITY. Extensive testing 
showed that this algorithm for producing postage 
stamps was a good compromise between the con- 
flicting needs of having enough sky pixels present in 
the postage stamp to give a robust fit of the object, 
while keeping the postage stamps small enough to 
be fit in reasonable amounts of CPU time. 

2. In the next step, Galapagos decided from this 
postage stamp and the aperture map, which sec- 
ondary objects had to be deblended and fitted 
simultaneously and which objects were simply 
masked out during the fitting process. For this it 
created a second map where SExtractor aper- 
ture ellipses were increased in linear size by a fac- 
tor of 1.5 (a factor of 2.25 larger area). Every ob- 
ject whose ellipse overlapped with the ellipse of the 
primary object was fitted simultaneously using a 
single Sersic profile; every other object with pix- 
els in the postage stamp was masked out during 
the fit, using this expanded ellipses as the mask 14 . 

14 In many other fitting routines the SEXTRACTOR segmenta- 
tion map is used for masking; our masks are considerably more 
conservative. 
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Fig. 3. — Density of sources in simulation images. Left: One of the two simulated images with galaxies having Sersic index of 4. We plot 
a circle with radius r e (simulated) at the correct x,y location of each simulated galaxy. In total there are 800 simulated n = 4 galaxies 
placed in a single ACS tile, ~ 250 of which are too LSB to be detected by S EXTRACTOR; they could nonetheless influence the fitting results 
by contributing to the image background. This simulation represents an extreme case for testing the limits of profile fitting with Gim2d 
and Galfit. Right: the sources in a simulation of a typical Gems ACS image (Gems tile 04) using Galfit fitting results. There are 523 
total simulated galaxies, 374 with n=l (in dark grey) and 148 with n=4 (light grey). Stars and the few objects (in total 46 objects) that 
ran into a fitting constraint where excluded from this simu lation . On e can e asily see from this plot that real galaxies are significantly less 
crowded than the completely artificial simulations used in £|4.1, H and £14.1,21 



This way time-consuming fits, with 10 or more ob- 
jects to be simultaneously fitted, were avoided in 
most cases. In total for around 48%/31%/46% of 
the fits, at least one secondary object had to be 
taken into account (for n = 1 simulations, n = 4 
simulations and real galaxies respectively). In the 
most crowded situations we find that we needed to 
simultaneously fit a maximum of 9/7/12 profiles. 



jects (between 15 and 30 r e ) and an ellipse of width 
of 180 pixels at around 4-6 r e for the bigger objects 
(for details see Barden et al, 2006, in prep.). We 
call this sky estimate the 'isophotal sky' in what fol- 
lows, and testing shows that for fitting with Gal- 
fit the 'isophotal sky' provides significantly bet- 
ter fitting results than using sky values from, e.g., 
SExtractor (see 33X2]) . 



3. After this step the sky background was estimated. 
For this, GALAPAGOS used the aperture map on 
the whole science frame (and not only the postage 
stamp) and estimated the mean value of all pix- 
els that lay within 6 consecutive elliptical annuli, 
each with a width of 60 pixels (measured along the 
semi- major axis; corresponding to 1.8" using the 
Gems data with 0.03" /pixel). These 6 annuli par- 
tially overlap, with a spacing of 30 pixels between 
successive annuli. The annuli were centered on the 
primary fitting galaxy (pixels belonging to a sec- 
ondary object were ignored in this step). The in- 
nermost area is masked out during this process (the 
factor of 1.5 magnified aperture ellipse enlarged 
by a further 30 pixels). These annuli 'marched 
outward' together in radius in steps of 30 pixels 
until the gradient of the mean values within the 
last 6 rings (180 pixels) was larger than -0.05; the 
change in the sky value, given that the mean Gems 
F850LP sky background is around 18 counts, was 
then well below 0.3 % within this radial range. The 
sky was then determined as the mean value of the 
outermost 6 annuli. This made the area where the 
sky is determined to be an ellipse between 35 and 
215 pixels in semi- major axis for the smallest ob- 



4. In the same step, by dividing the elliptical indi- 
vidual annuli into 8 octants, Galapagos was able 
to detect sky gradients within an annulus as a 
function of position angle. Such cases were rela- 
tively rare, and were due to nearby bright objects 
that did not reach into the postage stamp them- 
selves (especially bright high Sersic index objects 
with strong wings). Galapagos then identified 
these objects in the SExtractor catalog auto- 
matically and these objects were fitted simultane- 
ously to eliminate this sky gradient (Galfit can 
fit profiles that are centered outside of the postage 
stamp). In the very rare cases that an identifi- 
cation was not possible although a strong gradi- 
ent was present (i.e. the object lay outside of the 
original Gems tile), we fit an artificial object cen- 
tered outside the postage stamp in the correct di- 
rection to achieve the same result. In total, 15.2% 
of the fits in the simulated disk sample needed an 
additional identified profile centered outside of the 
postage stamp, 1.5% needed an artificial, not iden- 
tified profile (4.3%, 0.6% for simulated spheroidal 
galaxies). For real galaxies only 3% of the fits 
needed an identified object, 0.4% needed an ar- 
tificial profile. Recall that the simulated images 
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contained a large number of galaxies not recovered 
by SExtractor; these galaxies contributed to the 
background sky only. These galaxies can lead to 
'sky' gradients found by GALAPAGOS. This effect 
should be, and is, more pronounced in the sample 
where fewer galaxies are recovered. 

5. The last step for setting up Galfit was the deter- 
mination of the starting guesses for the different 
fitting parameters from SEXTRACTOR and writ- 
ing them to a Galfit start file automatically (see 
Table [I]). We decided to fit single Sersic pro- 
files to all galaxies (with a starting value of 1.5 
for the Sersic index). Starting magnitudes were 
given by SExtractor MAG_BEST, sizes were given 
using FLUX_RADIUS (we used the formula r e — 
0.162 • R}fJ x , where R f i ux is FLUX_RADIUS. This 
formula was determined empirically using simula- 
tions). The axis ratio b/a was derived by taking 
the SExtractor ELLIPTICITY, the position an- 
gle by THETA_IMAGE. Furthermore the position of 
the objects within its postage stamp was required, 
which was directly given by the cutting process of 
the postage stamps (the object is centered within 
its postage stamp, see step 1). The parameter disk- 
iness/boxiness in Galfit was fixed to (no box- 
iness/diskiness) for all our fits. Furthermore, as 
described above, the estimated sky value from step 
3 was held fixed during the fit. Each object that 
had to be deblended during the fitting process was 
included (from step 2) with its appropriate starting 
values; all other objects were masked out (using a 
mask image with the by a factor of 1.5 enlarged 
SExtractor apertures that tells Galfit which 
pixels it should use and which pixels it should ig- 
nore during the fit). Finally, the Gems PSF (see 
Jahnke et al., in prep.) was provided to Galfit. 

We adopted a set of fitting constraints for Galfit 
which prevented the code from exploring unphysical (and 
time-consuming) areas of parameter space. We used 
0.2 < n < 8, 0.3 < r e < 500 [pixels of 0.03" in size] 
and fixed the fit magnitude to be within 5 mag of the 
SExtractor MAG_BEST. 

We used exactly the same constraints for real galaxies, 
also using the same setup procedure. Whenever we state 
that Galfit fitted 'successfully', we mean that Galfit 
returned a result (it did not crash during the fit) and the 
fit did not run into any of the constraints given above. 

3.1.2. Galfit sky test 

The estimate of the sky background is of critical im- 
portance in determining parametric or non-parametric 
descript i ons o f galaxy surface brightness profiles (e.g., 
Ide Jongl [199 6). While in principle it is possible to fit 
the sky level as an extra parameter, such a procedure 
requires that the surface brightness profile being used is 
an accurate description of the real galaxy light profiles. 
An alternative is to estimate the sky level as carefully as 
possible prior to the fit and hold it fixed. In this section 
we quantify the effect of different assumptions/estimates 
of the sky level for Galfit (the results of the equivalent 
test for Gim2d are shown in £|3.2.3[) . We test three se- 
tups: i) the isophotal sky, ii) the sky value determined 



by SExtractor, and Hi) allowing sky to be a free pa- 
rameter, to be estimated by Galfit. 

In Figure 2] one can see the difference between the 
sky values derived by the two sky estimation methods, 
Galapagos and SExtractor, for the two different 
samples of simulated n=l and n=4 galaxies, respectively. 
Because the simulations were added to a sky frame com- 
posed of empty patches of real sky, the true sky values 
were known to be 18.14 ± 0.03, indicated by the verti- 
cal dashed line in both plots. SExtractor recovers a 
mean value of 18.29 (cr=0.10) for disk galaxies and 18.40 
(er=0.11) for spheroidal galaxies. The isophotal estima- 
tor in Galapagos gives a mean value of 18.13 (<r=0.10) 
for disk galaxies and 18.26 (<7=0.11) for spheroidal galax- 
ies. Although all distributions have around the same 
width, one can see that both methods recover the sky 
better for the low Sersic-index sample. Furthermore, in 
both samples, the isophotal estimator gives back rather 
more accurate sky values. 

That SExtractor recovers a sky value that is slightly 
too high has been noted before - e.g., by the Goods 
team 15 and was the reason why we decided to write our 
own isophotal sky estimator. 

That the sky is easier to estimate for the n = 1 sim- 
ulations than for the n = 4 simulations can be partly 
explained by our simulation of a number of large, low sur- 
face brightness galaxies which escape detection by SEx- 
tractor and which inflate the sky surface brightness. 
Since galaxies with high Sersic index n have more ex- 
tended wings the effect of contamination in the outskirts 
is larger for n — 4 simulations than for n = 1 . There is a 
further effect for n — 4 galaxies: since the sky estimates 
provided by SExtractor and Galapagos only probe 
out to < 6r e for brighter galaxies, there is a residual con- 
tribution to the sky from the galaxy itself which becomes 
more serious as n increases. 

We compare the fitting results with the three different 
sky setups in Figure We only show results for the sam- 
ple of simulated n — 4 galaxies; the results for the n = 1 
galaxies were qualitatively similar but the systematic ef- 
fects are much weaker, showing very little difference be- 
tween the three different sky setups. The Y-axis shows 
the deviation of the three key parameters magnitude, r e 
and n from their true values, and the X-axis shows the 
simulated mean surface brightness fii np ut of the galaxies 
within an ellipse with semi-major axis r e and the axis 
ratio b/a: 

[i = mag + 2.5 • log(2 • b/a ■ tt ■ r 2 ) (5) 

where mag is the magnitude, b/a the axis ratio and r e the 
half-light radius of the object in arcsec. The factor of two 
accounts for the fact that only half the light is within the 
half-light radius. The top axis shows the mean S/N per 
pixel corresponding to that average surface brightness /it, 
given by 

S/N = (g) ■ [(g) + (g sky ) + a^ 1 ' 2 (6) 

where (g) is the average countrate [in e~] for galaxy pix- 
els within r e , (g s ky) is the background flux [in e~] within 
a pixel, and cr s ky is the uncertainty of the background 
sky estimate, obtained from the empty sky image. 

15 see http:/ /www. stsci.edu/science/goods/catalogs/rl.Oz_readme/ 
chapter 5.1 Local sky background 
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TABLE 3 

Starting guesses for Galfit when using Galapagos 



Parameter 


Starting guess 


from SEXTRACTOR 


mag 


MAG_BEST 




r e 


0.162 ■ FLUX.RADIUS 187 




b/a 


1 - ELLIPTICITY 




PA 


THETA IMAGE 




n 


1.5 




x ,y 


the postage stamp is centered 


on the primary object 




positions of secondary objects 


can be derived from SEXTRACTOR 



in 

o 




n=1 galaxies 



<x> 

w 
o 



17.8 18.0 18.2 18.4 18.6 
estimated sky 




17.8 18.0 18.2 18.4 18.6 
estimated sky 



Fig. 4. — This plot shows the recovered sky values for both used estimators, SEXTRACTOR (grey) and isophotal (determined within 
GALAPAGOS, black) for each of the simulated samples. The vertical dashed line indicates the true value. One can clearly see that both 
methods tended to overestimate the sky value for the spheroid sample, mainly due to the large wings of galaxies in this sample contributing 
to the sky level. 



At faint surface brightness levels, one can see that 
magnitudes are typically overestimated (i.e., are recov- 
ered too faint), sizes are systematically underestimated, 
and Sersic indices are typically underestimated (Figure 
[5]). The effects are subtle and affect only galaxies much 
fainter than the sky surface brightness for the isophotal 
sky and the sky fit by Galfit. These effects set in at 
much higher surface brightness (approximately 2 magni- 
tudes arcsec -2 above the sky level) for SExtractor- 
derived sky values. 

Following these test results (Tables 0] and [5]) , and the 
general concern that galaxies may deviate from paramet- 
ric descriptions of their light profiles in their outer parts, 
we choose to use the isophotal sky estimate for Galfit 
anal ysis for this paper (and those used in other papers, 
e.g.. iBarden et al. 2005). Should one not have access to 
accurate sky values from GALAPAGOS or a similar rou- 
tine, the test results show also that allowing Galfit to 
estimate the sky levels for single Sersic profiles is an ac- 
ceptable alternative, provided that the surface brightness 
profiles of the galaxies of interest are well-approximated 
by a Sersic profile over a wide range of radii. 

3.2. GIM2D 



GlM2p ( Ga l axy /m age 2D ) was written by Luc Simard 
(|Simardl fl998. 2002) as an IRAF package for the quanti- 
tative morphological analysis of galaxies. We use version 
3.f for the analysis in this paper. For a single Sersic fit 
we work in 7-dimensions with the bulge fraction param- 
eter set to B/T = 1; thus, we find the best-fit model 
described by /tot, ?"c, e, 4>b, dx, dy, and n. During the fit, 
the images are deconvolved with a given PSF. Gim2d 
uses the Metropolis algorithm to find a % 2 minimum, 
which makes it less prone to settle on local minima. On 
the other hand, this algorithm is time consuming. Ac- 
cordingly, to process large datasets, Gim2d ought to be 
run on many machines in parallel. 

3.2.1. GIM2D setup 

As with Galfit, Gim2d requires certain generic con- 
siderations for galaxy profile fitting: (1) postage stamp 
construction, (2) nearby companion masking, (3) back- 
ground sky estimation, and (4) initial parameter guesses. 
We did not use Galapagos to set up Gim2d's galaxy fit 
for two reasons: Gim2d is embedded into IRAF whereas 
GALAPAGOS requires IDL; and the simultaneous fitting 
of galaxies is not supported in Gim2d, whereas much 
of Galapagos's algorithm is devoted to making deci- 
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Fig . 5. — This plot shows the fitting results of the spheroid galaxy sample when fitted with Galfit using three different sky estimates 
(see i|3. 1.2P : isophotal sky from GALAPAGOS (left), SExTRACTOR sky (middle) and the sky returned by Galfit when allowed to fit it as 
a free parameter. The X-axis shows the simulated mean surface brightness within r e defined by /i = mag + 2.5 ■ log(2 ■ b/a ■ n ■ r^) , 
where mag is the magnitude, b/a the axis ratio and r e the half-light radius of the object in arcsec. The thin vertical grey line in the plot 
indicates the brightness of the sky background. The upper X-axis shows the mean signal-to-noise ratio per pixel within r e calculated by: 
S/N = (g) ■ [(g) + {g s ky) + a sky\~ 1 ' 2 j where (g) is the average counts in a galaxy pixels within r e (basically /x), (g B k y ) is the background 
flux within a pixel and a s k y is the uncertainty of the background sky estimation. Although this number is only a rough approximation, it 
gives a feeling about the mean S/N of the galaxies. The Y-axis shows magnitude difference (fitted - simulated), size ratio (fitted/simulated) 
and Sersic index fitting results. Perfect parameter recovery is indicated as the horizontal thin dark-grey line. The thick light-grey line and 
the thick dashed dark-grey line indicate the mean value and lcr values for different surface brightness bins. The small crosses show the 
galaxies that where fitted 'successfully', meaning that the fit returned a result and that it did not run into fitting constraints. As one can 
clearly see getting a good estimation of the sky level is important. Both using the isophotal sky estimation (leftmost column) and using 
the sky level as a free parameter during the fit (rightmost column) return more reliably results than using SExTRACTOR sky estimations. 



sions about which galaxies are to be simultaneously fit. 
Therefore, Gim2d is set up by using a different proce- 
dure, which we describe in this section. 

Starting with the combined hot/cold SEXTRACTOR 
output catalogues, a square postage stamp was cut from 
the large image, centered on each galaxy with size given 
by 4ai so x 4<2i SO , where aj so is the major axis diameter of 
the SExTRACTOR isophotal area in pixels (the minimum 
postage stamp size we allowed was 101 x 101 pixels). 
Gim2d masks out nearby objects using SEXTRACTOR 
segmentation maps: discussion of the consequences of 
this procedure is presented later in this section, and in 
£ 14.1.21 For sky estimation and defining the best part of 
the fitting parameter space to explore, Gim2d has sev- 
eral important setup parameters that allow the user to 
modify its behavior. In this section, we describe some 
of the most important ones - parameters that we find to 
critically affect the performance of the code. 

The parameter 'dobkg' specifies whether Gim2d de- 



termines the background itself ( 1 dobkg ' =yes) or fixes 
the sky to a user-defined value (' dobkg '=no). With 
' dobkg' =yes, Gim2d calculates the background prior to 
galaxy fitting directly from the postage stamp images of 
each source using only non-object (sky) pixels as speci- 
fied by the SEXTRACTOR segmentation map. As such, 
this method is closely dependent upon extracting a large 
enough image to get a reliable sky measurement. Once 
determined, the sky value is held fixed during the fitting. 
If 'dobkg '=no, Gim2d assumes that the postage stamps 
have background equal to zero; therefore, the user may 
use an external method to estimate the sky and subtract 
this from the input images. Gim2d does offer an op- 
tion to fit the background offset (parameter db) as a free 
parameter during fitting, but this is not recommended 
when working with real galaxies with non-idealized pro- 
files. We test the effect of different sky estimates in detail 
in gffll 

Gim2d, like Galfit, has constraints which can be ap- 
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TABLE 4 

Gim2d: Fitting of n = 4 Simulations using Galfit: Bright subsample with < 22.5 and 

mag in < 22.5 



Sky used 


Sersic 


n 


rso 


ratio 


Amag 


e ratio 


APA 


Quality 




mean 


17 


mean 


a 


mean a 


mean a 


mean a 




isophotal sky 
SExtractor sky 
Galfit sky 


3.99 
3.79 
3.94 


0.27 
0.29 
0.24 


1.00 

0.96 
0.99 


0.05 
0.07 
0.06 


0.00 0.03 
0.03 0.04 
0.01 0.03 


0.99 0.04 
1.00 0.04 
1.00 0.04 


0.17 1.54 
0.17 1.56 
0.16 1.55 


0.06 
4.66 
0.38 



Note. — This Tabic summarizes the results from using different sky estimators with Galfit for bright 
galaxies; sec Table [5] for results for faint galaxies. 

The columns give deviations (resistant mean values clipped at 3cr) from the simulated value and scatter for 
the 5 key fitting parameters. The a values given are values computed iterativcly for all galaxies within 3 a. 
The last column gives the fit quality. This number is defined as: 



Quality = 1000 * [(An/4 - if + (Are - if + (Amagf + (A(b/a) - if + (APA/180) 2 )] (4) 

where A values are given as the mean values in the table. This quantity is a fairly intuitive combination of the 
different fit parameters, indicating in broad terms which setups perform well (low values) and which perform 
poorly (high values). One can see that indeed using the isophotal sky as given by Galapagos and using the 
sky level as a free parameter during the fit return much more reliable results than the SExtractor sky already 
for these bright galaxies. Using the isophotal sky seems to be the ideal setup. 



TABLE 5 

Gim2d: Fitting of n = 4 Simulations using Galfit: Faint subsample with 23.5 < n ln < 26.0 



Sky used Sersic n rso ratio Amag e ratio APA Quality 

mean tr mean tr mean a mean a mean a 



isophotal sky 3.95 1.13 0.97 0.44 0.02 0.42 1.04 0.18 0.20 6.31 2.92 

SExtractor sky 3.05 0.98 0.61 0.27 0.41 0.39 1.05 0.18 -0.06 6.38 375.33 
Galfit sky 3.78 1.16 0.94 0.47 0.09 0.44 1.04 0.19 0.07 6.12 16.93 



Note. — Same as Tabled but for faint galaxies. As one can clearly see from this table and as was expected, 
fitting faint galaxies is more difficult. Using the isophotal sky during the fit returns the best fitting results, 
slightly better than the internal estimation in Galfit. Using the SExtractor sky returns significantly worse 
results. 



plied to limit the regions of parameter space searched for 
solutions. Gim2d starts with a user-specified parameter 
space, given by the initial value and minimum/maximum 
hard limits for each parameter to be fit. Gim2d has an 
option to automatically narrow the focus of the input pa- 
rameter space by setting ' initparam' =yes. With this 
setup option Gim2d uses FOCAS-like image moments 
based on information extracted from the SExtractor- 
created segmentation map to estimate the hard limits for 
the model parameter space. 

Under all setups Gim2d starts in the Initial Condition 
Finder (ICF) mode, which explores the user-specified 
parameter space coarsely to find the best initial model 
guess. In practice, the ICF creates A/icf models through- 
out the allowed parameter space, selects the best one, 
and then reduces the search volume by a factor equal 
to iVicF- The final result from the ICF is used as the 
starting point by the Metropolis algorithm. The Gim2d 
website gives a default value of A/icf = 100. 

To find the best-fitting setup we rigorously tested a 
large number of different setups of Gim2d. We do not 
discuss all of the different setups here; the most impor- 
tant ones are shown in Tables [7] (for bright galaxies) and 
[1] (for faint galaxies) and will be discussed in detail in the 
following sections starting with the recommended Gim2d 
setup ( p.2 2p . sky tests (t J3T2T3]) . other tests (i }3?2~4|) con - 
cluding with the final adopted best-fitting setup f ^3.2.5|) . 



3.2.2. Gim2d recommended setup 

In Figure in the leftmost panels, we show fit- 
ting results for the setup that is recommended on the 
Gim2d webpage 16 (setup K in Table [7j). This rec- 
ommended setup, in particular, has 'dobkg'=yes and 
' initparams ' =yes; i.e., Gim2d determines the sky level 
and fitting constraints from SExtractor output. As is 
clear from this plot, this setup produces unsatisfactory 
results even for fairly high surface brightness galaxies 
and where Gems survey completeness is still quite high. 
The systematic errors are already ~ 50% in r e near the 
sky level. Fitting results are strongly systematically bi- 
ased towards fainter magnitudes, smaller sizes and lower 
concentrations. 

As most galaxy surveys aim to push their analysis 
down to faint levels, the ideal performance of any fit- 
ting code is to provide parameter estimates that are free 
of significant systematic trends. Therefore, we deem the 
recommended setup to not be suitable for the Gems sur- 
vey. In an attempt to improve the Gim2d performance, 
we tried a number of different strategies, among them 
different settings of 1 initparams ' and 1 dobkg ' . 

Through extensive testing, we find that the best re- 
sults are obtained when both ' initparams ' =no and 

16 The GIMFIT2D descript ion is 

http://www.hia-iha.nrc-cnrc.gcca/STAFF/lsd/gim2d71 an d 
specifies the last program update ol March 19, 2001 
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Fig. 6. — This plot compares the recommended Gim2d setup to setups where we used different settings of the Gim2d parameters 
'initparam' and 'dobkg' [from left to right: setup K (initparams=+, dobkg=+, recommended), J (-, +),C (+, -)and A (-, -; best), see Table 
[7J. The X-axis shows the simulated surface brightness of the galaxies. The Y-axes again show magnitude difference, size ratio and Sersic 
index fitting results. The thick light-grey line and the thick dashed dark-grey lin e indi cate the mean value and the la line for different 
surface brightness bins. One can see easily that our best-fitting Gim2d setup (see £|3.2.5| I fits galaxies with much less systematic bias than 
the initial setup recommended on the Gim2d webpage. This is especially true for galaxies fainter than the surface brightness of the sky. 



' dobkg '=no (see rightmost panels in Figure [6]), and 
when the SExtractor local background is used (see 
i j3.2.3[) . Setting ' initparams ' =no and dobkg=yes pro- 
duces very modest improvement. Setting 'dobkg' =no 
and initparam=yes helps considerably, giving satisfac- 
tory results for galaxies with surface brightness higher 
than the sky surface brightness 17 . As can be seen in Fig. 
[6l using a fixed background (dobkg '=no) and setting 
initparams ' =no removes the strong systematic trend 
towards poorer fits for low surface brightness galaxies, 
albeit with large scatter at the faint end. As explained 
in §3.2.1[ the initparams ' =no option allows Gim2d to 
explore the full range of parameter space when deter- 
mining the best-fit solution. In contrast, setting this op- 
tion to 'y es ' narrowly constrains the magnitude and r e 
for objects where the SExtractor segmentation map 
severely misses the total extent of galaxies below the sky 
brightness. It is worth noting that compared to these two 

17 This is the setup that was used by [Mcintosh ct al. (2005) 
for their study of the evolution of the early-type n > 2.5 galaxy 
luminosity— size and stellar mass— size relations. Their sample of 
n > 2.5 galaxies all had F850LP surface brightness brighter than 
22.5 magarcsec -2 , and inspection of the third row of panels in 
Fig.[6]and setup J in Table 171 shows that at these limits the Gim2d 
fitting results suffer from < 10% biases. 



TABLE 6 
Parameter limits used 

FOR GlM2D WHEN USING 
INITPARAM— NO 



Parameter 


min 


max 


mag 


20 


27 


r e 


0.3 


500 


cllipticity 


0.0 


1.0 


PA 


all 


all 


n 


0.2 


8.0 


centering x 





3.0 


centering y 





3.0 



parameter choices, other effects such as the precise fixed 
sky value used (see 33.1.2|) and the image size ( fl3.2.4jl 
appear to produce only minor improvements. The 
minimum and maximum limits of the parameter space 
that we allowed in our best setup are manually set to 
span more than the entire range of the simulations in 
terms of size, luminosity and Sersic index, more than the 
physically useful parameter range of real galaxies, so that 
the solutions are not 'pinned up' against the boundary 
values artificially imposed on them (the actual values are 
given in Table [J) . Fits that ran into any of the given fit- 
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ting constraints were removed from the sample for the 
analysis in this paper. 

From our findings, we strongly recommend that Gim2d 
users avoid the dobkg=yes option and be cautious of 
the surface brightness effects that arise when using init- 
param=yes. 

3.2.3. Gim2d sky test 

In the above, we showed that sky value estimation can 
dramatically affect Gim2d fits using the default (rec- 
ommended) setting. In this section, we repeat the sky 
analysis for Gim2d as carried out in H3.1.2I for Galfit. 
We tested Gim2d using SExtractor local sky (setups 
A, B and C in Table [7J and the isophotal sky that we 
used for Galfit (setups D, E, F, G and H). We carried 
out one test - setup I - where we used the 'real' back- 
ground of 18.14 counts, determined on the 'sky' image 
used in the simulation process. Such a setup is of aca- 
demic interest only, as for real galaxies it is impossible 
to measure such a sky value. Nonetheless, this test gives 
insight into the performance of Gim2d when the actual, 
known, sky value is used as an input for galaxy fitting. 
For setup J and K we tried fixing the sky background to 
the value determined directly by Gim2d ('dobkg'=yes). 
We show results from these different tests in Figure [7] It 
is interesting that Gim2d performs somewhat better us- 
ing SExtractor local sky, whereas we showed in t|3.1.2l 
that Galfit performs somewhat better using the isopho- 
tal sky from Galapagos. It is likely that the cause of 
this behavior is related to how Galfit and Gim2d deal 
with nearby neighbors. Since Galfit simultaneously fits 
neighboring galaxies which overlap with the galaxy of in- 
terest, the isophotal sky estimate better represents the 
background pedestal that is common to the neighboring 
sources. Gim2d, on the other hand, is unable to simul- 
taneously fit neighbors, and relies on masking neighbors 
using the SExtractor segmentation map. Thus the 'ef- 
fective sky' for Gim2d includes flux from the outer parts 
of the galaxy itself and neighboring sources; as the SEx- 
tractor sky is derived from the same 'sky' area used 
for fitting, it is a more appropriate value. 

3.2.4. Gim2d other tests 

To determine the best-fitting setup to use with Gim2d, 
we performed 11 different tests (rows A-K) as shown in 
Tables [7] (bright galaxies) and [T] (faint galaxies). For 
the 'bright' galaxies, we selected all N galaxies with 
(Msim < 22.5 magnitudes arcsec -2 and mag S i m < 22.5 
(representing the sam ple of early-type galaxies from 
iMcIntosh et al.l (|2005f) . i.e., those important for surveys 
of early-type galaxy evolution) from the set of iV tot galax- 
ies in the sample for which Gim2d returned a result. The 
'faint' sample included galaxies with 23.5 < fi s i m < 26 
magnitudes arcsec -2 . We then calculated the mean of 
the recovered value or ratios of the different fit parame- 
ters and the 68% confidence interval. 

In our visual examination of the properties of the out- 
liers in these distributions, we found that most of the 
non-Poisson scatter is caused by contamination of the 
outer isophotes of the object of interest by nearby neigh- 
bors. While this issue is discussed in more detail later 
in ^4.1.31 we illustrate this behavior by running Gim2d 
on 3 different postage stamp widths of 2ai so (setup B), 



4a; so (setup A), and 6ai so (setup D). These tests find 
that there is an increase in scatter for larger image size, 
consistent with the expectation of contamination. Us- 
ing 2ai SO reduces the extreme outlier fraction somewhat 
from Aaiso- Yet, since such outliers are a small fraction 
of the objects, this change in postage stamp size had rel- 
atively little impact on the RMS scatter (see tables [7] and 
[J). The best- fit stamp cutouts we adopt here have sides 
equal to 4a; so . This seems to be the best compromise 
between a postage stamp large enough so that Gim2d in- 
cludes enough of the important outskirts of the galaxies 
for fitting, but small enough that neighboring galaxies 
are reasonably rare and CPU requirements are reason- 
able. For comparison, the stamp sizes used in Galfit 
fits are nearly always larger in area than 4ai so x 4a; so , due 
to the requirement of simultaneously fitting neighboring 
galaxies. We will further quantify the effect of neighbor- 
ing galaxies in 34.1. 21 using the set of simulated spheroids 
examined there. 

We also tested whether the initial number of ICF mod- 
els affected our fitting results. Holding all other setup 
choices constant we compared the results from fits to the 
n = 4 simulations with the default value of A^cf = 100 
(setup F in Table [7J, to results for AT ICF = 25 (setup E) 
and 400 (setup G). We found that the results are inde- 
pendent of the number of ICF models. 

3.2.5. Gim2d best-fitting setup 

As is apparent especially from Table Q] and Figure [6) 
the best combination of parameters for our simulations 
was given by setup A, using SExtractor background 
(dobkg—no), initparam—no and 4a; so as image sizes. We 
choose this setup to be our best and use it throughout this 
paper to compare Gim2d results with Galfit results. 

4. GALFIT/GIM2D COMPARISON USING 
OPTIMIZED SETUPS 

In this section, we discuss the results of testing our best 
setups of Gim2d and Galfit. Section [4~T1 describes the 
results obtained using the simulated images with artifi- 
cial distributions of galaxy parameters as explained in Sj2j 
Section 14.21 describes a very similar test using simulated 
galaxies having more realistic parameter distributions, 
as derived from real galaxies recovered from individual 
Gems survey fields. Section 14.31 sums up the results of 
tests where real images of different depths were fit and 
the results intercompared. 

4.1. Results of Fitting Simulated Galaxy Images 
4.1.1. Results of pure disk simulations 

Figure [8] shows both Galfit and Gim2d results for 
the set of simulated disk galaxies with an exponential 
n = 1 light profile. Of the 1600 galaxies simulated in this 
sample, 997 (62%) were recovered by SExtractor. Of 
these, 979 (98%) were successfully fitted by Galfit, 12 
(1.2%) ran into constraints, 6 (0.6%) fits crashed. Gim2d 
fitted 870 (87%) successfully, 46 (5%) ran into fitting 
constraints, 81 (8%) of the fits crashed. There are 4 
(0.4%) galaxies for which both codes failed. 

Crosses in Figure |8] represent galaxies that were fitted 
by both codes. Grey squares show galaxies that were fit- 
ted by that code only; the other code failed to return a 
useful result either through running into one of the fitting 
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Fig. 7. — This plot shows the same as Figure[5]but for Gim2d results (setups F, A and J in Table \7) . As is immediately obvious from 
these plots, Gim2d performs best when using the SEXTRACTOR background held fixed during the fit. 



constraints, or the fit crashed. The thick light-grey line 
and the thick dashed dark-grey line indicate the mean 
value and the 3(7 lines for different surface brightness 
bins of all galaxies that were fitted using that code. The 
left column shows fitting results using Galfit, the right 
column shows the results of the same set of simulations 
using Gim2d. The X-axis, showing the simulated surface 
brightness of the galaxies, is the same for all 6 plots. The 
3 rows show the results for magnitude (plots a and b), 
size (c and d) and Sersic index (e and f), respectively. 
The thin vertical line indicates the brightness of the sky 
background within the Gems survey. This is also roughly 
the limit up to which real galaxies are used for science 
within Gems. The Y-axes show deviations of the fit- 
ting values to the true parameter values, the horizontal 
thin line indicating the ideal value, which, in case of this 
galaxy sample, is simply the simulated value. 



We will discuss here and in all other sections the be- 
havior of the codes in 3 different surface brightness bins: 
firstly, the galaxies of highest surface brightness which 
one clearly would want to be fitted well with any code; 
secondly, galaxies within a surface brightness bin of 1 
magnitude around the surface brightness of the sky; and 
thirdly, the faintest galaxies, much fainter than the sky 
surface brightness. The third are the galaxies that are 
obviously hardest to fit. Here the results from the two 
codes differ the most from each other. 

To summarize our general findings, for n=l galaxies 
brighter than the sky's surface brightness, there is no 
significant mean offset between the input and recovered 
values in Figure [8J however, the scatter in the Gim2d 
results is somewhat larger. For this sample of galaxies, 
this would mean that the final results would be statisti- 
cally unaffected by one's choice of fitting code, but for 
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TABLE 7 

Gim2d: Fitting of n = 4 Simulations using Gim2d: Bright subsample with fn n < 22.5 and 

mag in < 22.5 



Setup 


N/Ntot 


Sersic 


: n 


^50 


ratio 


Amag 


e ratio 


A PA 


Quality 


mean 


17 


mean 


(7 


mean 




mean 




mean 


<7 


A 


164/540 


4.34 


0.61 


1.02 


0.13 


-0.04 


0.07 


1.01 


0.05 


0.00 


2.2 


7.02 


B 


168/533 


4.37 


0.43 


1.02 


0.09 


-0.03 


0.05 


1.01 


0.06 


-0.16 


1.5 


8.38 


C 


165/549 


4.32 


0.56 


1.01 


0.10 


-0.03 


0.07 


1.02 


0.06 


-0.10 


1.8 


6.31 


D 


161/533 


5.57 


1.49 


1.14 


0.22 


-0.14 


0.13 


1.02 


0.07 


-0.20 


2.0 


154.57 


E 


164/531 


4.84 


0.76 


1.11 


0.17 


-0.09 


0.08 


1.01 


0.06 


-0.17 


2.0 


44.42 


F 


165/539 


4.75 


0.67 


1.10 


0.17 


-0.08 


0.08 


1.01 


0.06 


0.05 


2.0 


35.63 


G 


162/539 


4.82 


0.75 


1.10 


0.17 


-0.08 


0.07 


1.01 


0.06 


0.34 


2.2 


42.02 


H 


161/545 


4.81 


0.76 


1.12 


0.16 


-0.09 


0.09 


1.01 


0.06 


-0.03 


1.6 


40.63 


1 


163/533 


5.68 


1.40 


1.27 


0.30 


-0.16 


0.14 


1.01 


0.06 


-0.03 


1.7 


177.28 


J 


167/551 


3.30 


0.49 


0.82 


0.14 


0.09 


0.09 


1.00 


0.05 


-0.18 


1.9 


30.40 


K 


168/546 


3.33 


0.51 


0.83 


0.13 


0.09 


0.09 


1.01 


0.05 


-0.01 


1.6 


28.32 



Note. — This Tabic summarizes the results from all Gim2d testing for bright galaxies; see Table [T] for 
results for faint galaxies. 

N/Ntot gives the numbers of galaxies N selected from the total sample of Ntot that Gim2d returns a result for 
each setup. The following columns give deviations (resistant mean values clipped at 3<r) from the simulated 
value and scatter for the 5 key fitting parameters. The a values given are values computed itcrativcly for all 
galaxies within 3 cr. 

The last column gives the fit quality as defined in Table [4"1 
Explanation of the setups: 

(A) SExtr. local bkg, initparam—no , Njcf — 100, 4ai so image sizes, best setup 

(B) SExtr. local bkg, initparam—no, Niqf — 100, 2ai so image sizes 

(C) SExtr. local bkg, initparam—yes, N\qf — 100, 4ai so image sizes 

(D) isoph. bkg, initparam—no, N\qf — 100, 6a.i so image sizes 

(E) isoph. bkg, initparam—no, Niqf — 25, and 4ai so image sizes 

(F) isoph. bkg, initparam—no, Njqf — 100, 4ai so image sizes 

(G) isoph. bkg, initparam—no,NicF — 400, and 4ai so image sizes 

(H) isoph. bkg, initparam—yes, Niqf — 100, 4a,i ao image sizes 

(I) bkg — 18.14, initparam—no, N\qf — 100, 4ai so image sizes 
(J) dobkg—yes, initparam—no, Niqf — 100, 4ai so image sizes 

(K) dobkg—yes, initparam—yes, Niqf — 100, 4a,i so image sizes, recommended setup 



TABLE 8 

Gim2d: Fitting of n = 4 Simulations using Gim2d: Faint subsample with 23.5 < //j n < 26.0 



Setup 


N/N to t 


Sersic 


: n 


»"50 


ratio 


Amag 


e ratio 


APA 


Quality 


mean 


<7 


mean 


(7 


mean 


a 


mean 


a 


mean 


a 


A 


151/540 


4.63 


1.45 


1.38 


0.78 


-0.20 


0.55 


1.05 


0.23 


-1.10 


7.4 


24.83 


B 


149/533 


4.52 


0.92 


1.44 


0.67 


-0.23 


0.43 


1.05 


0.21 


-0.13 


7.0 


17.35 


C 


154/549 


2.46 


0.86 


0.50 


0.23 


0.57 


0.48 


1.01 


0.19 


-0.64 


5.5 


149.08 


D 


147/533 


5.52 


1.36 


3.22 


1.93 


-0.76 


0.57 


1.00 


0.20 


-1.22 


10.6 


148.96 


E 


147/531 


5.05 


1.27 


1.89 


0.95 


-0.47 


0.49 


1.02 


0.16 


-1.17 


7.6 


69.38 


F 


149/539 


5.13 


1.26 


1.94 


0.93 


-0.47 


0.46 


1.03 


0.20 


-1.00 


6.3 


80.32 


G 


148/539 


5.24 


1.30 


2.36 


1.45 


-0.52 


0.51 


1.02 


0.16 


-1.43 


6.5 


97.78 


H 


153/545 


2.45 


0.92 


0.53 


0.27 


0.53 


0.50 


0.96 


0.16 


-0.88 


8.1 


150.72 


1 


146/533 


5.38 


1.19 


2.84 


1.51 


-0.73 


0.53 


0.99 


0.25 


-0.88 


7.0 


122.88 


J 


155/551 


1.79 


0.44 


0.28 


0.11 


1.08 


0.49 


1.02 


0.19 


-0.57 


9.2 


306.48 


K 


154/546 


1.78 


0.40 


0.28 


0.12 


1.10 


0.47 


1.04 


0.15 


-1.86 


7.5 


310.23 



individual objects the reliability of the Galfit results is 
slightly higher. 

For galaxies around the sky surface brightness, there 
are small systematic trends and increased scatter for our 
setup of Gim2d: a size ratio of 1.06 {r }itted /r simu i ated , 
a w 0.18) for Gim2d and a ratio of 1.02 for Galfit 
(a k. 0.08). This trend continues towards fainter surface 
brightness, although at no point does the systematic size 
offset exceed 20%. 

From Figure [8] (grey squares show objects fitted only 
by the respective code), one can easily see that Gal- 
fit returns a result more often than Gim2d, although 



the fraction of galaxies with failed fits is small in both 
cases. It is interesting to note that the properties of 
galaxies with failed fits is somewhat different between 
the two codes: galaxies not fit by Galfit (those fit only 
by Gim2d) are fainter than average, the parameters are 
discrepant even using Gim2d, whereas galaxies not fit 
by Gim2d (those fit only by Galfit) are fit almost as 
well by Galfit as other galaxies with the same surface 
brightness. 

4.1.2. Results of pure spheroid simulations 
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Fig. 8. — Fitting results for Galfit (left) and Gim2d (right) for the set of simulated n = 1 galaxies. The X-axis again shows the input 
surface brightness, the thin vertical grey line indicates the brightness of the sky background. The Y-axes are the same as in Figure [5] The 
input value is indicated as the horizontal thin dark-grey line, in this case representing a Sersic index of 1 for the sample of disk galaxies. 
The mean value of the deviations and a 3cr line are indicated for different surface brightness bins. The small crosses show the galaxies that 
where fitted 'successfully' by both codes. Grey squares indicate galaxies that where fitted by one code only and the other code did not 
return a meaningful result. The quality parameters as defined in Table|4]for this set of galaxies is 0.05/5.27 for Galfit (bright/faint) and 
1.01/251.67 for Gim2d showing that Galfit returns more reliable results for simulated disk galaxies in its optimal setup than Gim2d. 



Figure [9] shows the same plots as Figure |8j but for 
the simulated set of n = 4 profiles representing the 
light profile of a typical early-type galaxy. The to- 
tal number of galaxies recovered in this sample out of 
1600 simulated objects was 1091 (68%). Of these, only 
2 (0.2%) crashed in Galfit, 56 (5.1%) ran into con- 
straints; Gim2d crashed on 31 galaxies (2.8%), 36 (3.3%) 
additional fits ran into fitting constraints. 54 (5.0%) 
galaxies were fitted by Gim2d that were not fitted by 
Galfit, 63 (5.8%) galaxies were fitted by Galfit and 
not fitted by Gim2d, both codes crashed on 4 (0.4%) 
galaxies in common. When comparing Figure [5] to Fig- 
ure [51 one should be aware that the X-axis is shifted by 2 
magnitudes arcsec -2 towards fainter surface brightness, 
and that the dashed lines for clarity indicate la instead 
of 3cr as in Figure [8j It is clear that both codes recover 
the parameter values for n — 4 galaxies significantly less 
accurately than was the case for the n = 1 disks, result- 
ing in a substantially larger scatter. This is due to two 
different effects. Firstly, spheroidal profiles are in princi- 
ple harder to fit due to the importance of the outskirts 
of the light profile - this makes using an appropriate sky 
estimate much more important for a successful fit. Sec- 
ondly, due to the large amount of light in the faint wings 



of the galaxies, neighboring objects have a much bigger 
influence on the fit of the galaxy of interest than was 
the case for the exponential light profiles. This effect is 
particularly important for this simulated galaxy sample, 
because it was designed to have an unrealistically high 
number of large n — 4 galaxies. 

As was the case for disk galaxies, both codes are basi- 
cally indistinguishable for high surface brightness galax- 
ies in a statistical sense. For galaxies with surface bright- 
ness close to that of the sky, our implementation of Gal- 
fit recovers slightly better parameter values than Gim2d 
(size ratio of 1.00, a w 0.23 and a somewhat asymmetric 
error distribution for Galfit; size ratio of 1.14, a w 0.44 
and more asymmetric errors for Gim2d). The trend con- 
tinues towards lower surface brightness, with the Gim2d 
showing increasingly important systematic offsets and a 
substantially increased scatter. The directionality and 
asymmetry of the scatter in all plots (Gim2d and Gal- 
fit) are caused by neighboring contamination that is not 
fully removed, keeping in mind that 32% of the simulated 
galaxies escape detection by SExtractor because of 
their low surface brightness. 

4.1.3. Deblending effects 
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Fig. 9. — Code comparison for n = 4 galaxies. This figure is formatted in a similar way to Figure[8] but for the sample of simulated n = 4 
galaxies. The X-axis is shifted by 2 magnitudes arcsec - ^ compared to Fig[5] furthermore, in this plot the grey dashed line represent the la- 
limits. The value for a in this sample is around 3 times as large as was the case for disk galaxies. The true value of 4 for the Sersic index 
is again indicated as the horizontal thin line. The quality parameter for bright/faint galaxies is 0.06/2.92 for Galfit and 8.09/122.00 for 
Gim2d using this sample of galaxies. Numbers given here are different from the numbers given in Tables [7] and [l] as a different, larger sample 
of galaxies was used for this analysis. From fewer galaxies in Table [T] the best setup A there returns a slightly higher mean Sersic index 
which translates into a higher quality parameter. The numbers show clearly that Galfit returns better results than Gim2d. Interestingly, 
the quality parameter for Gim2d is smaller for faint spheroids than for faint disks, suggesting that these galaxies are fit more reliably. This 
might be due to the way the quality parameter was calculated in detail (using resistant mean offset values cut at 3 <r which removes large 
scatter, indeed the systematic offset is smaller in case of spheroidal galaxies, the scatter does not go into the quality number) but does not 
reflect the plots, the disk plots look better due to smaller scatter. 



Given the significant differences in philosophy when 
it comes to the deblending techniques between Galfit 
(multiobject fitting & masking) and Gim2d (masking 
only), we explore the recovery of input parameters as 
a function of the immediate environment of a galaxy for 
both codes. We analyze the subset of 390 (out of a to- 
tal of 1033) n — 4 simulated galaxies where GALAPAGOS 
decided that Galfit needed to simultaneously fit two or 
more profiles. This has the advantage that only signif- 
icant neighbors are included in this analysis and should 
be sufficient to demonstrate the magnitude of the influ- 
ence of deblending on the quality of galaxy fitting with 
Galfit and Gim2d. 

The results are summarized in Figure I10[ showing 
the difference between recovered and input magnitude 
as a function of the distance to the next neighbor 
(left) and as a function of the brightness of this neigh- 
bor (right). Fitting neighboring objects simultaneously, 
Galfit (panels a and b) is able to deblend these galaxies 
reliably and the deviations of the fitting magnitudes is 
independent of both distance and brightness of the clos- 



est neighbor. For Gim2d (panels c and d), it is clear 
that fitting residual is a strong function of both distance 
and brightness of the nearest neighbor. The closer and 
brighter a neighboring object is, the larger is the mag- 
nitude deviation. In an attempt to disentangle the in- 
fluence of distance and brightness, we try to correct for 
the systematics observed in panels c and d by removing 
the offsets and the slope, showing the results in panels 
e and f. It is clear that distance and brightness effects 
of nearest neighbor cannot be easily corrected, thus can 
significantly impact the performance of Gim2d in recov- 
ering the true parameters for simulated n — 4 galaxies. 
For isolated galaxies, Gim2d does an excellent job of re- 
covering the properties of n = 4 galaxies. 

4.2. Results of Simulations representing simulated 
GEMS tiles 

Bearing in mind the importance of neighboring galax- 
ies in determining the quality of fit, we repeated the 
above analysis using a sample of galaxies where n = 1 
and n = 4 galaxies were intermixed with realistic clus- 
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Fig. 10. — The impact of neighboring galaxies on fit results with Galfit (upper row) and Gim2d (middle and lower row). The left 
column shows magnitude deviations from the simulated values as a function of the distance to the next neighbor; the right column shows 
the difference between the recovered and simulated values as a function of the brightness of the nearest neighbor. Gim2d shows strong 
systematic offsets as a function of both distance to the nearest neighbor and its brightness. In the lower two panels, we try to correct for 
the systematics observed in panels c and d by showing the distance dependence of the offset- magnitude relation residuals (panel e), and 
the magnitude dependence of the offset-distance relation residuals (panel f). 



tering, sizes and magnitudes. Towards this goal, simula- 
tions were produced from the Galfit results of two real 
Gems tiles using recovered values of magnitude, position 
and size. The only parameter that was changed was the 
Sersic index. Every galaxy with a real Sersic index of 
2.5 or smaller was simulated with a Sersic index of 1; all 
others with a Sersic index of 4. These simulations have 
the advantage that they are better able to estimate the 
uncertainties of galaxy fits with Gems data. 

The results are shown in Figures QT] and [T3J It is 
worth noting that the range in galaxy surface bright- 
ness is much smaller in these simulations, although we 
have left the X-Axis the same as in the previous plots 
to facilitate comparison with these. We also show the 
surface brightness histograms of galaxies used in Bar- 
den et al. (2005; disk galaxies) and Mcintosh et al. 
(2005; spheroidal galaxies), to show which areas of pa- 
rameter space are especially important for scientific anal- 
ysis of data. Inspecting Figs. [11] and [T^l it becomes 
clear that Gim2d and Galfit perform more similarly for 
galaxy populations with clustering and properties typi- 
cal of medium-depth cosmological HST surveys than for 
purely simulated data. Galfit shows increased scatter 
and mild systematic offsets compared to the earlier sim- 
ulations. In the case of the n — 1 galaxies the difference 



in behavior is especially pronounced: it is clear that the 
presence of realistically clustered n — 4 galaxies around 
n=l galaxies is a larger source of random error in galaxy 
fitting for both Galfit and Gim2d than in pure n=l 
simulations. Gim2d shows very similar behavior com- 
pared to the earlier simulations, with still larger scatter 
and systematic offsets than Galfit. 

4.3. Results of deep-shallow tests using GOODS and 
GEMS data 

Simulations have the disadvantage that the galaxies 
have unrealistically simple structure and light-profiles 
that are known a priori to be the same as the profiles 
used for fitting. Accordingly, in this section we test the 
performance of the codes on real galaxies. This goal is 
not straightforward to achieve, inasmuch as one does not 
know what the real parameters of a given galaxy are, or 
indeed whether or not real galaxies are well described 
by the Sersic light-profile that was used during our anal- 
ysis. Instead, we take an empirical approach and test 
whether the fitting results obtained for real galaxies are 
sensitive to the image depth by comparing fitting results 
from the same galaxies in the 1-orbit depth Gems survey 
and the overlapping 5-orbit depth GOODS survey. If they 
were sensitive to the image depth, it would show that the 
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Fig. 11. — Fit results from re-simulated Gems tiles (see i|4,2l l. disk galaxies. Same as Fig. [8] but for a sample of 2 Gems tiles that were 
re-simulated in order to create a more realistic distribution of galaxy parameters and object density (the results for the n = 4 galaxies in 
this sample are shown i n Figure 1 1211. In the upper panel we overplot the surface brightness histogram of the 5664 disk galaxies that where 
selected for analysis by Bardcn ct al. (2005) showing where fitting accuracy is especially important. 



Sersic profile is of limited applicability in describing the 
light profile of real galaxies. 

Inspection of Figure [13] shows clearly that both codes 
are reasonably self-consistent when fitting the same 
galaxies on images of different depth, i.e. neither Gal- 
fit nor Gim2d depends strongly on image depth. While 
robustness to image depth does not imply that the fit- 
ting results are necessarily correct, it does give confidence 
that issues such as low surface brightness disks miss- 
ing from shallow HST imaging, departures from Sersic 
profiles at fainter surface brightness levels, etc., do not 
appear to seriously compromise the reliability of fitting 
parameters in 1-orbit depth HST/ A CS data. 



4.4. Error estimations from GIM2D and GALFIT 

It is interesting to consider if the internal error esti- 
mates from Galfit and Gim2d are reasonable reflec- 
tions of the more realistic uncertainties given by how 
well the codes recover input parameters for simulated 
galaxies. In Fig. [HI we address this issue by exploring 
the distribution of the error estimate a divided by the 
deviation of the fit result from the true value A. One 
can see a strong peak of values with er/A <C 1, i.e., for 
these galaxies the deviation A is much larger than the 



error estimate ct 18 . Under the assumption that the error 
estimates are correct, er/A should be >1 for 68% of the 
galaxies. Fig. [H] shows that cr/A > 1 for much less than 
50% of the cases; i.e., both Galfit and Gim2d substan- 
tially underestimate the true fit uncertainties, clearly in- 
dicating that the dominant contribution to fitting uncer- 
tainty is not shot and read noise; instead contamination 
by neighbors, structure in the sky, correlated pixels, pro- 
file mismatch, etc., dominate the errors. Fig. [T4l shows 
no difference between the histograms of cr/A for Galfit 
and Gim2d; i.e., Galfit and Gim2d both underestimate 
the errors to a similar extent. Accordingly, in this work 
and all other Gems works, we have not used the error 
estimates given by Galfit or Gim2d on an object-by- 
object basis, relying instead on the mean and width of 
the parameter distributions from Figs. [Tl] and [9] at the 
surface brightness of the galaxy in question. 

The uncertainties given in Table [S] are calculated and 
interpolated by using the surface brightness /j, and the 
Sersic index n and the results from the simulated data 
(see 3S]for details about this procedure). 

4.5. Further considerations 

18 This behavior was the motivation for plotting <r/A instead of 
the more intuitive quantity A/cr. 
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Fig. 12. — Results from re-simulated Gems tiles, n = 4 galaxies. Same as Fig. [9] but for the n = 4 galaxies in the re-simulated Gems 
tiles. The histo gram in the uppe r panel shows the surface brightness distribution of the 929 red-sequence galaxies that were selected for 
analysis in Mcintosh ct al. (2005]). 



I n the course of our prep aration of lBarden et alj (2005) 
and lMdntosh et all (|2005l ). we found that there were two 
additional practical considerations that potential users of 
Galfit and Gim2d may wish to consider. 

• Galfit is substantially less CPU intensive than 
Gim2d, reducing the cost and time of fitting large 
datasets. 

• Gim2d, at least in our implementation, failed to 
return a fitting result reasonably frequently, requir- 
ing manual intervention to restart the code. When 
trying to fit large datasets, we found this to be 
labor-intensive. In contrast, Galfit treated each 
fit as an individual task and therefore was run from 
shell scripts one fit after the other; if Galfit does 
not return a fitting result, the script automatically 
starts the next fit, requiring no interaction from the 
user. 

5. COMPARISON WITH PIGNATELLI ET AL. 
(2006) 

In this paper we present an extensive and thorough 
test of the two different 2-D galaxy fitting codes Galfit 
and Gim2d. In t his se ction, we compare our results with 
IPignatelli et al.l (|2006l ). who compared results from test- 
ing these two codes with their own 1-dimensional profile 
fitter, Gasphot. 



Pignatelli et al. conclude that Gasphot per- 
formed substantially better for significantly (realisti- 
cally) blended objects than either Galfit or Gim2d. In 
the course of our testing, we fo und a number o f differ - 
ences between our analysis and IPignatelli et ail (|2006h . 
which we felt may significantly affect their conclusions. 

• For the simulations examined in this paper, they 
used the IRAF task mkobject, which, as we de- 
scribed in [J2j is inexact for the inner pixels of a 
simulated galaxy light profile. According to ini- 
tial tests, the differences in the profiles can lead 
to systematic errors of up to 10-20% in the fitting 
parameters using Galfit due to the lack of over- 
sampling of the inner pixels when using at least 
our settings of artdata parameters, and depending 
on the exact profile parameters. As we used our 
own simulation script in IDL, improving the pro- 
file from IRAF might be possible by using different 
parameters settings. 

• In their paper, Pignatelli et al. allow the sky value 
to be fitted as a free parameter for all three codes. 
We argue in this paper that this is a non-optimal 
way to run any galaxy fitting code: not only would 
one be subject to errors from irregularities around a 
Sersic profile, but also the tests shown in this paper 
show that fitting the sky level as an additional pa- 
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Fig. 13. — Comparison of fits to deep vs. shallow images. The left column shows results using Galfit and the right column shows 
results for Gim2d. Both codes were run on the same sample of real galaxies. Note that in this plot the X-axis shows the Galfit surface 
brightness derived from the deeper GOODS data. The Y-axis shows the deviations of the three key parameters of the galaxies between the 
'deep' and the 'shallow' fit. In the uppermost plots we again overplott e d the histograms of the disk galaxy sample of Bardcn ct al. (2005, 
left histogram) and the spheroid-dominated sample of Mcintosh ct al. (2005;, right histogram), right histogram) to highlight out the area 
of parameter space where fitting and independence of the image depth is particularly important. 



rameter leads to significantly worse fits (especially 
in the case of Gim2d). Estimating a value for the 
sky before running the fitting codes and keeping 
this value fixed returns more accurate galaxy pa- 
rameter values. 

• Pignatelli et al. state that all automatic tools are 
likely to have problems with blended objects. Like 
them, wc find that deblending is necessary when 
setting up fitting routines. Masking out blended 
objects, while better than doing nothing at all, 
still leads to significantly biased results: this ap- 
pears to lie at the root of Gim2d's difficulties in 
fitting some simulations (Fig. ITU)) . We find, fur- 
thermore, that if one fits multiple galaxies simulta- 
neously (as is recommended when using Galfit), 
Galfit returns stable unbiased galaxy parameters, 
even in strongly-blended cases (and in cases with 
intermixed n = 1 and n = 4 galaxies in which n = 4 
galaxies play an impo rtant role and which w as not 
tested in the paper bv lPignatelli et all (|2006t ); com- 
pare to § 14.21 in this work) . Their argument that 
Galfit does not deal well with blended galaxies 
is an artefact of the mode in which they chose to 
use Galfit, in particular the lack of simultaneous 



fitting of neighboring objects. 

• Pignatelli et al. only show the Gim2d results 
for n=4 galaxies; according to our tests these 
are the hardest galaxies to reliably fit, and show- 
ing only those galaxies leads to a false impres- 
sion of the frequency and severity of Gim2d's dif- 
ficul ties with nearby neig hbors. Also, it seems 
that lPignatelli et alT(|2006f l have used the standard 
setup for Gim2d, which, according to our tests, be- 
haves poorly for faint galaxies: the influence of this 
decision on their fitting results is unknown. 

6. GEMS GALFIT RESULTS 

In this section, we present the Galfit F850LP-band 
fitting results of all 41,495 Gems objects that were found 
by SExtractor. We include fit results for all unique ob- 
jects, be they stars or galaxies. Some objects appear on 
two or more Gems frames; in this case the fit results for 
the images lying furthest from the frame edge was taken. 
Table[9]shows the 10 first objects in the catalog and gives 
the format of the catalog published in the online version 
of this paper. It includes the following values: 

1. RA (1), Dec (2): RA and DEC, given by SEx- 
tractor(J2000). 
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Fig. 14. — A comparison of estimated and real errors for two simulated samples of disk (n = 1) galaxies and spheroidal (n = 4) galaxies. 
Shown is the histogram of the errorbars <r devided by the deviation from the input value A. We show <r/A instead of the more intuitive 
quantity A/a, which would show a very wide distribution and effects are not as obvious as here. Calculated as rr/A, in principle for 68% 
of all galaxies this value should be >1. The number given in each plot shows the fraction of galaxies for which this is true. One can 
easily see that magnitude and Sersic index errorbars are dramatically underestimated by both Galfit and Gim2d; the r e uncertainties are 
significantly better estimated. 



2. tile (3): the Gems tile in which the galaxies ap- 
pears 

3. Snum (4): the SExtractor catalog number of 
this object 

4. GemsID (5): the identification of the galaxy within 
the Gems project. 

5. PosX (6), PosY (7): the position [pixels] of the 
galaxy in this given Gems tile. 

6. sky (8): The background pedestal as returned by 
Galapagos and used during the fit with Galfit. 

7. The Galfit results (9-13): magnitude, halflight- 
radius r e , Sersic index n, Axis Ratio b/a, and 
position angle (both with respect to the image, 
PAi m , and with respect to the WCS, defined north- 
to-east, PAwcs) as well as their 'uncertainties'. 
These uncertainties are not the errorbars returned 
by Galfit; as shown in §4.41 these errorbars do 
not reflect the true uncertainty of the fit. We 
use a statistical method to derive the error es- 
timates from our simulations. We first estimate 
from our simulations the scatter of the distribu- 
tion (of the n = 1 and n — 4 galaxy sample, 



respectively) at the given surface brightness [i of 
the real object for n = 1 and n = 4 simulations. 
Then, we perform a linear interpolation between 
the a(n = 1,/i = ^ bs) and <r(n = 4, /i = /i /, s ) to 
estimate a(n = n b s , /i = fi bs)- We do not extrap- 
olate; galaxies with n < 1 are given the value of 
the n = 1 sample, n > 4 galaxies the value of the 
n = 4 sample. We further adopt a minimum un- 
certainty for each fitting parameter (0.01 mag for 
mag, 0.01 pixels for r e , 0.01 for n, 0.001 for b/a and 
0.1 deg for PA). In the table published online and 
on the Gems webpage, the uncertainties are stored 
in extra columns. 

8- fcon (14): A flag showing which fits ran into any of 
the fitting constraints (0: fit ran into constraint, 1: 
fit did not run into any of the constraints). 

9. fsd (15): A flag showing which galaxies would be 
selected according to the selection criteria given in 
iBarden et al.1 (|2005l ) (0: object would not be se- 
lected for analysis, 1: object would be selected for 
analysis). The primary effect of the selection is to 
discard stars and very low surface brightness ob- 
jects. 
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As is clear from Fig. [TBI and [2l the catalog has strongly 
varying completeness, primarily as a function of surface 
brightness. Many applications of the Gems catalogs re- 
quire a good understanding of the se completeness proper - 
ties. In lBarden et alJ ()2005l ) and lMcIntosh et all (|2005f ). 
we used the simulations presented in this paper to quan- 
tify the effects of completeness. Accordingly, we have 
made extensive suites of simulation catalogs available to 
interested users on the Gems webpage to allow detailed 
examination of systematic errors in fitting and sample 
completeness. These issues are discussed in substant ially 
more detail inlRix et all (|2004h . lBarden et alJ <|2005f ) and 
iMdntosh etahl (pOOl . 

Figure [TTjl shows the parameter distribution of the sub- 
set of 34,638 objects for which the fit did not run into fit- 
ting constraints. Galaxies plotted in black and indicated 
by the contours would pass the selection in lBarden et al.1 
((20051) . One can see that galaxies discarded (plotted 
in grey) are mostly faint, low surface brightness galax- 
ies. Another important class of objects thrown out of the 
sample are objects with either very small sizes or rela- 
tively small sizes at high magnitudes. These are identi- 
fied as stars (o r saturated sta r s) by the automated selec- 
tion criteria in iBarden et al.l (|2005h . Although all these 
objects are still included in Table [§J one should be very 
careful when using their fitting results. All these galaxies 
are indicated by f SC i = 0. 

Figure [TBI shows histograms of the most important pa- 
rameters (surface brightness /i, apparent magnitude, ap- 
parent size r e and Sersic index n) for the subset of 23,187 
ob jects that wou l d be s elected according to the cuts given 
in lBarden et"aH (|2005f ). 

7. CONCLUSIONS 

In this paper, we have tuned and tested two paramet- 
ric galaxy fitting codes - Galfit and Gim2d - for fit- 
ting single Sersic light profiles to both simulated and real 
data. Our conclusions are the following: 

• The performance of both Galfit and Gim2d is a 
strong function of how the codes are set up; in par- 
ticular, studies using different setups of paramet- 
ric fitting codes may arrive at different conclusions 
about those codes if not properly or optimally used. 

• The recommended setup of Gim2d, using 
'dobkg'^yes' and 1 initparams='yes' is unable 
to recover the input parameter values of simulated 
n = 4 galaxies that were fainter than the sky 
surface brightness. We strongly discourage users 
from using these settings, but to instead fix the 
background to the value local of each galaxy as 
given by SExtractor and to input very wide 
model parameter limits. This is very important 
if the SExtractor segmentation map does not 
represent the true extent of a galaxy, as was the 
case for galaxies below the sky surface brightness 
when using standard SExtractor configurations. 

• Both codes are able to fit (at least bright) n = 1 
galaxies reasonably well with relatively little bias 
(Figure [8]) . Concentrated n — 4 galaxies are sub- 
stantially less straightforward to fit, owing to the 
large amount of light in the faint outer parts of the 
galaxies. For bright n — 4 galaxies, the behavior of 



Galfit is better than that of Gim2d (Figure [9]); 
however, parameters returned by Gim2d are still 
not significantly biased. For galaxy populations 
and clustering typical of medium-depth cosmologi- 
cal HST surveys, there are no large differences be- 
tween results obtained using Gim2d and Galfit 
for these bright galaxies. For fainter galaxies, the 
performance of Galfit is substantially better than 
that of Gim2d. In the set of realistically mixed 
simulations (Figure [TTJ and [T2"[) of n = 1 and n = 4 
galaxies, representing re-simulated Gems tiles, one 
can see significantly different behavior of the two 
codes, especially in the recovery of the Sersic index. 
Gim2d results are systematically biased to higher 
Sersic indices, which, in automated galaxy classifi- 
cation using the Sersic index to distinguish early- 
from late- type galaxies, would lead to systematic 
misclassification of a subsample of (faint) galaxies. 

• The errorbars given by both codes underestimate 
the true uncertainty of the fit by a large factor. 
One has to use a different approach to derive more 
realistic errorbars. 

• Our testing demonstrated that how a code treats 
neighboring galaxies can be of great importance. 
Gim2d only masks out neighbors, which in the 
tests we ran could lead to poor fitting results for 
strongly blended objects. Galfit, in contrast, is 
able to simultaneously fit many objects, and when 
used in that mode seems to be relatively robust to 
contamination by neighbors. For this reason, we 
caution users interested in strongly clustered galax- 
ies against using Gim2d without extensive prior 
testing. 

• Both Galfit and Gim2d are self-consistent and 
show no discernable dependence on image depth 
when comparing fitting results from Gems and 
Goods data. 

• Our tests on deep and shallow data show that real 
galaxies are indeed reasonably well described by 
general Sersic light profiles. 

• Galfit works best using an isophotal sky value 
given by GALAPAGOS. If this is not possible, using 
Galfit to internally derive a sky value is signifi- 
cantly better than fixing the sky to a local value 
given by SExtractor. 

• On the balance, we would tend to recommend Gal- 
fit for single Sersic profile fitting in medium-depth 
HST/ ACS data, as Galfit results are not only 
somewhat more reliable in the mean, but also have 
lower scatter and less sensitivity to contamination 
by neighbors than Gim2d. 
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Fig. 15. — This figure shows histograms for 23,187 objects from t he Gems survey that were fitted without running into constraints when 
fitted by Galfit and selected according to the selection criteria in Barden ct al. (2005). From left to right we show surface brightness fi, 
apparent magnitudes, apparent sizes r e (logarithmic scale) and Sersic index n. For comparison we overplotted the surface brightness of the 
sky as a vertical line in the leftmost plot and the cut of n=2.5 in the rightmost plot, which is frequently used to distinguish between disk- 
and bulge-dominated galaxies in an automated fashion. 
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Fig. 16. — Here we show parameter distributions for all galaxies in the catalog published in Table [9] excluding the ones where the fit 
ra n into any of the fitt ing constraints. Galaxies plotted in grey do not end up in the science sample according to the selection criteria used 
in IBarden et alj d2005l) . Galaxies plotted in black and indicated by the contours pass this selection. In both plots one can see that the 
galaxies thrown out are mostly faint low surface brightness galaxies. 
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Galfit fitting results for all Gems galaxies 
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5.36±1.02 


0.558±0.054 


-60.0(-62.2)±5.8 


1 





53.370589 


-28.062339 


s9z01A 


10 


GEMSJ033328.94-280344.4 


5.61 


341.22 


18.426 


23.22±0.12 


19.79±0.14 


0.66±0.29 


0.302±0.064 


-77.1(-79.3)±5.1 


1 





53.342879 


-28.061422 


s9z01A 


11 


GEMSJ033322.29-280341.1 


2940.04 


444.77 


18.098 


23.05±0.11 


14.03±0.13 


1.00±0.28 


0.619±0.063 


-38.8(-41.1)±5.1 


1 


1 


53.342659 


-28.060898 


s9z01A 


12 


GEMSJ033322.24-280339.2 


2963.58 


507.56 


18.171 


20.05±0.01 


0.30±0.17 


1.17±0.92 


0.140±0.001 


52.4( 50.1)±0.4 


1 





53.347315 


-28.061235 


s9z01A 


13 


GEMSJ033323.36-280340.4 


2470.43 


468.25 


18.223 


22.96±0.08 


6.30±0.10 


1.18±0.27 


0.804±0.051 


57.5( 55.2)±4.1 


1 


1 


53.341322 


-28.059825 


s9z01A 


11 


GEMSJ033321.92-280335.4 


3105.38 


636.05 


18.144 


21.56±0.10 


43.40±0.12 


0.45±0.27 


0.123±0.059 


50.7( 48.4)±4.7 


1 


1 


Note. — This 


tabic shows the 


first 10 objects of the 


Gems fitting results published in the c 




. of this papi 


3T. For every object that we 


ia found by SExtractor we give RA, DEC, the Gems til 


e name, the SEXTRACTOR 


number 


of that 



object, the GemsID (containing RA and DEC) as well as the X and Y position on the GEMS tile, so that identification of objects is easily possible. Furthermore we give the isophotal sky value that was used during the fit and the fitting 
results of the 5 key parameters mag (in apparent F850LP-band magnitudes), r e (in pixels of 0.03"), Sersic index n, axis ratio b/a and position angle PA in respect to the image (counted counterclockwise from vertical line according 
to the convention within GALFIT) and the world coordinate system (in brackets, north through east). Uncertainty estimates are derived from the simulations, following ^4.4| The last two columns show two different flags showing which 
fits ran into fitting constraints (fcon — 0) and which ones would make it into the final galaxy sample used for science according to the selection criteria given in Barden et al. i2005) (f sc i = 1)- 
The complete version of this table is in the electronic edition of the Journal. The printed edition contains only a sample. 



